Minimum Classification Error Based Optimal Linear Combination for Spoken Language Identification

نویسندگان

  • Donglai Zhu
  • Rong Tong
  • Bin Ma
  • Haizhou Li
چکیده

State-of-the-art language identification systems are commonly constructed with multiple parallel classifiers to take advantage of different levels of speech features. These classifiers are combined with a fusion module to make the final decision. Following the maximum a posteriori (MAP) decision rule, the fusion of multiple classifiers can be transformed to a constrained optimal linear combination (OLC), where the linear weights represent the prior probabilities of the classifiers. We derive an optimization algorithm for the constrained linear weights based on the minimum classification error (MCE) criterion. The proposed method is evaluated on the NIST 2003 LRE task. Compared with the best individual classifier, the fusion approach reduces the equal error rate (EER) at relative rates of 10.2%, 24.4% and 25.7% for 30-second, 10-second and 3-second durations of speech segments, respectively.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Minimum classification error training of hidden Markov models for acoustic language identification

The goal of acoustic Language Identification (LID) is to identify the language of spoken utterances. The described system is based on parallel Hidden Markov Model (HMM) phoneme recognizers. The standard approach for parameter learning of Hidden Markov Model parameters is Maximum Likelihood (ML) estimation which is not directly related to the classification error rate. Based on the Minimum Class...

متن کامل

مقایسه روش های طیفی برای شناسایی زبان گفتاری

Identifying spoken language automatically is to identify a language from the speech signal. Language identification systems can be divided into two categories, spectral-based methods and phonetic-based methods. In the former, short-time characteristics of speech spectrum are extracted as a multi-dimensional vector. The statistical model of these features is then obtained for each language. The ...

متن کامل

Discriminative model combination

Discriminative model combination is a new approach in the field of automatic speech recognition, which aims at an optimal integration of all given (acoustic and language) models into one log-linear posterior probability distribution. As opposed to the maximum entropy approach, the coefficients of the log-linear combination are optimized on training samples using discriminative methods to obtain...

متن کامل

Discriminative Training and Support V Language Call Ro

In natural language call routing, callers are routed to desired departments based on natural spoken responses to an open-ended “How may I direct your call?” prompt. Natural language call classification can be performed using support vector machines (SVMs) or the popular vector-based model used in information retrieval. We recently demonstrate how discriminative training is powerful to improve a...

متن کامل

Studying Effectiveness of Landsat ETM+ Satellite Images Classification Methods in Identification of desert pavements (Case study: South of Semnan)

Extended abstract 1- Introduction The process of identifying landforms is a subject that has been researched by many researchers. All the definitions of geomorphology emphasize the study and identification of landforms. Understanding landforms and how they are distributed are some sort of essential requirements in applied geomorphology and other environmental sciences (Shayan et al., 2012). O...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2006